Back

Human Molecular Genetics

Oxford University Press (OUP)

Preprints posted in the last 7 days, ranked by how well they match Human Molecular Genetics's content profile, based on 130 papers previously published here. The average preprint has a 0.13% match score for this journal, so anything above that is already an above-average fit.

1
Investigating the Y chromosome in complex disease: Phenome-wide scan across 104,334 Finnish men

Preussner, A.; Leinonen, J. T.; FinnGen, ; Pirinen, M.; Tukiainen, T.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.09.26355235 medRxiv
Top 0.6%
3.9%
Show abstract

Although the Y chromosome represents roughly 2% of the male genome, it is often ignored in genome-wide association studies (GWAS). Subsequently, the potential health impacts of Y-chromosomal genetic variation remain incompletely understood. To fill this gap, we performed a phenome-wide association study (PheWAS) in FinnGen across 1,426 binary and quantitative traits using Y-chromosomal variation (frequency [&ge;] 1%) in 104,334 genotyped men. As Y chromosome variation is prone to population stratification, we performed carefully adjusted association analyses and further examined these through kin-based validation in 19,275 female and 24,712 male 1st degree relatives. We found 121 suggestive (p < 5.6x10-3) phenotypic associations in the Y chromosome, yet none of these were strong enough to reach phenome-wide significance (p < 3.9x10-6). While only 38 associations were supported in the kin-based validation, intriguingly we found support for a previously suggested link between haplogroup I1 and coronary heart disease (CHD; OR=1.06, 95%CI=1.02-1.11, p=3.7x10-3; male validation OR=1.05; female validation OR=0.97). The I1-CHD association was detected across distinct geographical areas within Finland and was independent from Loss of Y (LOY) and the autosomal risk to CHD, proposing a link between germline Y-chromosomal variation and heart disease risk. Overall, this study presents a comprehensive phenome-wide analysis of Y-chromosomal associations, highlighting the potential relevance of Y-chromosomal variation beyond sex determination. Our findings further emphasize the need for improved capture of Y-chromosomal variants and further analyses in biobank-scale data to allow for deeper exploration of male-specific genetic architecture of complex diseases.

2
Structured Patterns of Muscle Involvement in CAV3-Related Myopathy Revealed by Whole-Body CT Imaging

De Los Reyes, F. V. A.; Hayashi, S.; Saito, Y.; Ogawa, M.; Oya, Y.; Noguchi, S.; Nishino, I.

2026-06-04 radiology and imaging 10.64898/2026.06.03.26354504 medRxiv
Top 0.9%
2.8%
Show abstract

Caveolinopathies caused by CAV3 mutations present with heterogeneous clinical phenotypes ranging from asymptomatic hyperCKemia to limb-girdle-type muscular dystrophy. Although prior imaging studies have described commonly affected muscles, structured modeling of muscle involvement patterns in caveolinopathy has not been established. We analyzed whole-body skeletal muscle computed tomography imaging in eight patients with pathogenic or likely pathogenic CAV3 variants, comprising 14 imaging study samples. Fat infiltration across 43 muscles was graded using modified Mercuri scores. Computational multivariate analysis,including principal component analysis, clustering, and pseudotime modeling,was applied to characterize severity staging and distribution patterns. A statistically supported, stage-dependent continuum of muscle involvement was identified. Most samples demonstrated a distributed limb-girdle-predominant pattern with coordinated progression across muscle clusters. In contrast, one patient (three samples in longitudinal series) exhibited a compartment-restricted thigh-dominant pattern characterized by early posterior and medial thigh involvement. Rectus femoris showed consistent stage-dependent progression, while greater medial gastrocnemius involvement was associated with advanced severity. None of the patients exhibited clinical evidence of rippling muscle disease. These findings suggest that integrating semi-quantitative imaging with computational modeling may provide an objective framework for characterizing muscle involvement patterns in CAV3-related myopathy.

3
Human genetic evidence links serine biosynthesis to diabetic peripheral neuropathy

Fridman, V.; Kakar, A.; Jensen, A.; Van de Vondel, L.; Wheeler, A.; Phillips, L. S.; Zhou, J.; Zuchner, S.; Reusch, J.; Raghavan, S.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.09.26355286 medRxiv
Top 1%
2.6%
Show abstract

Diabetic peripheral neuropathy (DPN) is a common and disabling condition for which no disease-modifying therapies are available. Glycemic and metabolic drivers do not fully explain why only a subset of individuals with diabetes develop DPN, and genetic contributors remain poorly defined. We aimed to perform a multi-population genome-wide association study (GWAS) of DPN to highlight potential new etiological pathways and therapeutic targets. Methods We performed a multi-population GWAS of neuropathy in people with and without diabetes using the VA Million Veteran Program and UK Biobank, followed by replication in the All of Us Research Program (AoU), and gene-based and gene-set analyses to identify implicated pathways. Causal relationships between circulating serine levels and DPN were further tested using two sample Mendelian randomization. To further evaluate pathogenic potential, we analyzed rare, high impact variants in GWAS implicated genes among individuals with unresolved inherited neuropathies using the GENESIS platform. Findings Among individuals with type 2 diabetes, we identified seven genome wide significant loci (p<5x10-): PHGDH and PSPH (key serine synthesis genes), TEAD1, CYP4F11, LARGE1, FTO, and COBLL1. No loci were significant in individuals without diabetes or with type 1 diabetes. Four loci (PHGDH, TEAD1, FTO and CYP4F11) replicated in AoU (p <0.05). Mendelian randomization demonstrated that higher genetically predicted serine levels were associated with lower DPN risk, consistent with a causal role of serine metabolism in disease pathogenesis. Rare-variant burden analyses revealed associations of predicted deleterious variants with inherited neuropathy case status in PHGDH (odds ratio [OR] 12.7 [95% CI 7.9, 20.4]), PSPH (OR 8.5 [7.2, 10.2]), PHKG1 (OR 4.8 [3.7, 6.3]), and LARGE1 (OR 0.007 [0.0004, 0.1]). Interpretation Convergent genetic evidence across common and rare variation implicates serine synthesis as a key pathway in DPN. These findings link diabetic and inherited neuropathies through a shared metabolic mechanism, identifying serine metabolism as a potential therapeutic target.

4
Rare neurological and neurodevelopmental variants in ALS link to onset, survival and family history

O'Donoghue, C.; Kacar, E.; Gomes, T.; Costello, E.; Pender, N.; Peelo, C.; Ryan, M.; Heverin, M.; Byrne, S.; Bede, P.; Hardiman, O.; McLaughlin, R. L.; Byrne, R. P.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.09.26354977 medRxiv
Top 1%
2.4%
Show abstract

Background: Neurological, neuropsychiatric, and neurodevelopmental disorders cluster in ALS families, sharing a common genetic architecture with ALS. Pathogenic variants in genes associated with other neurological, neurodevelopmental, or neuropsychiatric disorders may also co-occur in ALS and modify phenotype. We have sought to determine the prevalence and clinical pattern of likely-pathogenic/pathogenic (LP/P) non-ALS neurological, neurodevelopmental, and neuropsychiatric variants, alone and in combination with ALS-gene variants, in two large ALS cohorts. Methods: Whole-genome sequencing (WGS) of 469 Irish and 774 Answer ALS people with ALS (pwALS) was analysed for ClinVar LP/P variants associated with other neurological (n = 15541), neurodevelopmental (n = 9761), and neuropsychiatric (n = 321) phenotypes. Inheritance patterns for associated genes (autosomal recessive/autosomal dominant) along with the associated phenotype were validated using OMIM. Standardised clinical data included family history, site and age of onset, El Escorial category, survival, motor decline, and cognitive and behavioural assessments. Known ALS-gene variants and C9orf72 repeat expansion status were included for each cohort. Results: Non-ALS neurological variants were identified in 47/469 (10.0%) Irish and 69/774 (8.9%) Answer ALS participants, most frequently in hereditary spastic paraplegia-associated genes (3.2% Irish; 2.8% Answer ALS). Irish neurological variant carriers showed higher frequency of respiratory onset (10.6% vs 1.2%, Fisher's exact p = 0.002, {Phi} = 0.20) and fewer premorbid behavioural symptoms (0.92 +/- 0.56 vs 3.08 +/- 0.97, Cohen's d = -0.40). Neurodevelopmental variants occurred in 12/469 (2.6%) Irish and 20/774 (2.6%) Answer ALS participants. In the Irish cohort, neurodevelopmental variant carriers had significantly shorter survival in Cox proportional hazards model (log-rank p = 0.005), corresponding to a more than two-fold increased hazard of death (HR = 2.25, 95% CI 1.26-4.00), and had significantly increased familial burden of neuropsychiatric disorders among first- and second-degree relatives (negative binomial IRR for carriers = 2.41, 95% CI: 1.12-5.18, p = 0.025). Across combined cohorts, 18 individuals (Irish n = 8; Answer ALS n = 10) carried [&ge;]2 LP/P variants spanning ALS and non-ALS genes. Conclusion: Rare LP/P variants in genes associated with other neurological and neurodevelopmental disorders occur in up to 12% of pwALS across two independent cohorts. Carriers show distinct phenotypes, shorter survival, and characteristic family history patterns. These findings suggest that extended pleiotropic and oligogenic architectures may contribute to ALS heterogeneity.

5
Breast cancer polygenic risk score performance varies by socioeconomic status

Domian, H. I.; Tian, X.; Ong, D.; Hamilton, L.; Shieh, Y.; Musharoff, S. A.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354819 medRxiv
Top 1%
2.1%
Show abstract

Background: Polygenic risk scores (PRS) for breast cancer are increasingly used for risk stratification to inform screening and prevention. However, for PRSs to be equitable and clinically useful, they need to perform well across diverse populations. While PRS performance is known to be ancestry-dependent, it is not well understood how environmental context, such as that of socioeconomic status (SES), affects PRS transferability. Here, we assess whether SES, measured via self-reported household income, modifies breast cancer PRS performance and, if so, whether socioeconomic context contributes predictive information beyond genetic risk alone. Methods: We used the US-based All of Us biobank to evaluate how SES impacts breast cancer PRS performance. First, we quantified changes in breast cancer PRS performance by modeling a commonly-cited polygenic score for breast cancer previously described by Mavaddat et al. with SES. We then reestimated the genetic effect sizes of the 3,820 variants from Mavaddat et al. in All of Us with and without income as a covariate. Because social determinants of health affect breast cancer detection and outcomes, we stratified analyses by socially defined populations on the basis of self-identified race and ethnicity. We further stratified individuals whose self-identified race is White (''White'') into three SES groups (high, middle, low) based on self-reported income and re-estimated genetic effect sizes to create SES-specific PRSs. We then applied these PRSs to White participants, the largest group in the study, and to Black or African American (''Black'') and Hispanic or Latino (''Hispanic'') participants, groups underrepresented in breast cancer research. Model discrimination between cases and controls was measured by area under the curve (AUC). Results: We analyzed 163,715 women from the All of Us biobank, which included 8,833 breast cancer cases (6,619 White, 1,178 Black, and 1,036 Hispanic), with relative income available for a subset of these cases (5,525 White, 848 Black, and 566 Hispanic). The ancestry-dependent performance of the breast cancer PRS described in Mavaddat et al. was replicated in All of Us. In Black individuals, this PRS (AUC and 95% CI: 0.576 [0.571, 0.582]) produced a similar increase in AUC as relative income (AUC: 0.573 [0.568, 0.577]) when added to an age-only model. Incorporating income with PRS, age, and genetic PCs 1-3 improved AUC by 0.007 in White Americans and 0.018 in Black Americans (both p < 10-11), while attenuating the contribution of PRS in the full model. PRS performance also varied among SES categories. Notably, PRSs with variant effect sizes that were recalibrated in low-SES White participants performed best in low-SES White participants (AUC: 0.605 [0.583, 0.628]) and Black Americans (AUC: 0.588 [0.586, 0.591]), both better than performance in high-SES White Americans (AUC: 0.579 [0.577, 0.580]) and middle-SES White Americans (AUC: 0.578 [0.569, 0.586]). Conclusion: Socioeconomic context, measured by income, significantly impacts the transferability of a PRS for breast cancer within and among groups defined by self-identified race and ethnicity. Accounting for SES improves PRS performance, most notably in Black Americans and low-SES White individuals.

6
Heterozygous MMACHC burden variants are associated with higher circulating vitamin B12 in the All of Us Research Program

Cai, L.; DeBerardinis, R. J.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354855 medRxiv
Top 1%
1.9%
Show abstract

Heterozygous carriers of autosomal recessive disease variants are conventionally considered unaffected, yet population-scale genomic datasets reveal subclinical carrier phenotypes. MMACHC encodes a cobalamin-processing protein whose biallelic loss causes cobalamin C deficiency, an inborn error of intracellular cobalamin metabolism. We performed an unbiased quantitative phenome-wide association screen in All of Us Research Program v8 to identify phenotypes associated with rare heterozygous MMACHC burden variants. Serum/plasma vitamin B12 was the top quantitative association. Carriers had higher circulating B12 than non-carriers in adjusted analyses, but also higher homocysteine, suggesting that elevated circulating B12 does not reflect improved intracellular cobalamin function. Carriers were less likely to fall below conventional B12 insufficiency thresholds, indicating a potential diagnostic blind spot. A pathway-wide rare-variant gene-burden (All-by-All) gene-burden analysis placed this finding in broader biological context. Burdens in genes related to circulating B12 binding or intestinal absorption were associated with lower circulating B12. In contrast, burdens in several genes involved in cellular delivery and intracellular cobalamin handling were associated with higher circulating B12. This step-specific directionality supports a model in which elevated circulating B12 can reflect impaired cellular handling and consequent systemic accumulation rather than improved cellular cobalamin availability. Because EHR-derived B12 is shaped by heterogeneous clinical and medication contexts, prospective carrier-enriched studies with standardized methylmalonic acid, homocysteine, diet, supplement, medication, comorbidity, and symptom ascertainment are needed to evaluate functional-marker-based screening.

7
Contextualizing the Utility of Polygenic Risk Scores using Absolute Risk Models in Diverse Ancestry Populations

Chatterjee, N.; Martina, F.; Kachuri, L.; Natarajan, P.; Witte, J.; Huo, D.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354842 medRxiv
Top 2%
1.6%
Show abstract

Polygenic risk scores (PRSs) are emerging as powerful tools for quantifying inherited risk for common diseases and, in some cases, are approaching clinical implementation. A major concern for PRS implementation is their limited accuracy in non-European populations, particularly in those of African ancestry. However, past evaluations have focused on metrics such as relative risk or AUC, which do not capture background risk arising from contextual factors. We introduce a novel measure of variable importance, the conditional average derivative estimator (CADE), to evaluate PRS utility across diverse contexts and populations within absolute risk models that integrate PRSs with other relevant risk factors. We illustrate this framework by integrating PRSs for breast and prostate cancer within age-specific absolute risk models for incidence and mortality fit using individual-level data from the All of Us Research Program with inputs from the National Cancer Institute SEER cancer registry. Our projections show that although the PRSs are known to have the lowest discriminatory accuracy in African Americans (AA), there are contexts in which they provide greater utility, such as for the stratification of prostate cancer risk and mortality, where the CADE values for AA were 2- and 7-fold higher than for European Americans. These findings suggest that conclusions about the limited clinical utility of PRS in non-European populations may be premature and underscore the need to quantify PRS risk-stratification utility at the absolute-risk level, while accounting for disease onset, survival, and broader health and economic factors.

8
Natural History of Prenatally Identified Children with 48,XXYY Syndrome in Infancy and Early Childhood

Nocon, K.; Swenson, K.; Bothwell, S.; Howell, S.; Davis, S.; Ikomi, C.; Ross, J.; Tartaglia, N.

2026-06-04 pediatrics 10.64898/2026.06.04.26353909 medRxiv
Top 2%
1.4%
Show abstract

Background: 48,XXYY syndrome is a rare sex chromosome aneuploidy (SCA) characterized by neurodevelopmental deficits and medical comorbidities. The limited information available in the literature is almost exclusively limited to postnatally diagnosed cases. This study aims to describe the early medical and developmental features of prenatally identified 48,XXYY infants, with comparisons to 47,XYY, 47,XXY cohorts, and typical populations, as well as previously reported postnatally diagnosed 48,XXYY cases. Methods: The eXtraordinarY Babies Study prospectively follows children prenatally identified to be at high risk for SCA with annual medical and neurodevelopmental evaluations. Data presented herein include the prevalence of medical conditions, developmental milestones, developmental and adaptive functioning assessment scores, and therapy utilization in participants confirmed to have 48,XXYY. Comparisons were made between this cohort and the typical population, infants with 47,XYY and 47,XXY also enrolled in the eXtraordinarY Babies Study, and a 2008 cohort of individuals postnatally identified 48,XXYY. Results: Infants with 48,XXYY exhibited a range of early medical features, including high rates of feeding and GI disorders (breastfeeding difficulties, gastroesophageal reflux, and eosinophilic esophagitis), allergic disorders (food allergies and environmental allergies), and hypotonia. Developmental and adaptive functioning scores indicated delays in motor, communication, and social domains, with nearly all infants receiving speech therapy, physical and/or occupational therapy. Comparisons with the 47,XYY and 47,XXY cohorts revealed more medical and developmental challenges in the 48,XXYY group, however there was variability and some overlap with both the general population and sex chromosome trisomy conditions. Additionally, comparison to the 2008 postnatally identified 48,XXYY cohort indicated that while prenatal diagnosis allowed for earlier intervention, developmental outcomes in the first years of life were similar between the two groups. Conclusions: 48,XXYY diagnosed prenatally facilitates early monitoring, anticipatory guidance, and proactive referrals for medical evaluations and intervention, given developmental delays and medical challenges are more common in infancy and early childhood compared to the general population and trisomy SCAs. These findings provide valuable insights for genetic counselors and healthcare providers, emphasizing the spectrum of medical and developmental findings and importance of early and proactive care to support individual outcomes. Prospective study of this prenatally identified cohort will provide important natural history and phenotypic variability in XXYY, as well as identification of predictors of health and developmental outcomes.

9
Whole-exome-based preconception carrier screening in Uzbekistan with targeted SMA, FMR1, and DMD assays: the first reported clinical program

Kullyev, A.; Avdeichik, S.; Akimenkova, A.; Kartuesov, A.; Kardymon, O.; Goikhman, Y.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.02.26354713 medRxiv
Top 2%
1.4%
Show abstract

Abstract Purpose: Published clinical outcome data on preconception carrier screening (PCS) in Central Asia are limited. We report the first clinical implementation study from Uzbekistan of a whole-exome sequencing (WES)-based multi-platform PCS program combining exome sequencing with targeted SMA, FMR1, and DMD assays. Methods: We retrospectively analyzed anonymized data from 65 individuals (19 couples, 27 singletons) screened at IMC Genomics, Tashkent, between January 2024 and May 2026. WES covering the protein-coding regions of approximately 20,000 genes was followed by exome-wide bioinformatics filtering and clinical geneticist interpretation. Partly overlapping cohorts underwent SMA carrier screening (n=179), FMR1 CGG-repeat analysis in females (n=155), and DMD deletion/duplication testing in preconception females (n=29). Variants were classified by ACMG/AMP criteria against gnomAD v4.1. Results: Sixty-one of 65 WES-screened individuals (93.8%; 95% CI 85.2 - 97.6%) carried at least one reportable variant (152 instances across 126 genes). Four of 19 couples (21.1%; 95% CI 8.5 - 43.3%) were concordant for pathogenic or likely pathogenic variants in the same autosomal recessive gene; two were referred for preimplantation genetic testing for monogenic disease. SMA screening identified four carriers, including two 2+0 silent carriers; FMR1 analysis identified one intermediate allele; DMD MLPA identified no exonic rearrangements. Conclusion: This first reported WES-based multi-platform PCS program in Uzbekistan was feasible and clinically informative, identifying actionable couple-level reproductive risks and supporting structured implementation of reproductive genetic screening in Central Asia.

10
Phenome-wide association of multiallelic copy number variation in 422,170 UK Biobank individuals reveals novel genetic loci associated with disease

Eisenberg, M.; Packer, R.; Shrine, N.; Demidov, G.; Pack, H.; Hollox, E. J.; Fawcett, K.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.03.26354825 medRxiv
Top 2%
1.3%
Show abstract

The contribution of multi-allelic CNVs (mCNVs) to disease risk has not been widely studied. This is largely because they have been difficult to characterise at a large-scale genome-wide, and are often not strongly associated with flanking SNVs, limiting imputation. Improved understanding of the role of mCNVs in disease risk could lead to novel insights into the pathobiology of disease. We robustly typed 69 mCNVs from UK Biobank whole exome sequences in discovery (n=150,682) and replication sets (n=269,317). Discovery and replication PheWAS used clinically-curated composite phenotypes by integrating self-report, primary and secondary health care data to interrogate these variants, for unrelated British individuals of African, European and Central/South Asian ancestries. 173 mCNV-phenotype associations were detected from 26 mCNVs, of which 114 associations replicated. One of eight potentially novel mCNV-phenotype signals was independent of neighbouring associated SNVs, the association of Sulfotransferase 1A1 and 1A2 genes (SULT1A1/SULT1A2) with estimated glomerular filtration rate (eGFR) in individuals of European ancestry (meta-analysed p=1.05x10-9, beta=0.016 [0.011; 0.021]). Other potentially novel associations include Golgi phosphoprotein 3 (GOLPH3) with the cardiovascular phenotype bundle branch block in individuals of South Asian ancestry (meta-analysed p=3.35x10-6, OR=2.13 [1.53, 2.96]) and alpha amylase 2B (AMY2B) with ventricular fibrillation and flutter in individuals of European ancestry (meta-analysed p=2.48x10-6, OR=1.50 [1.26; 1.78]). In summary, we show that accurate typing of biobank-scale sample sizes can identify associations between traits and mCNVs, acting through a gene dosage relationship. Our work provides several novel likely causative variants contributing to particular traits of clinical importance and immediately suggest a putative functional mechanism for the observed associations.

11
Association of body composition, daily physical activity and handgrip strength with mortality, cardiovascular events and cancers in Japanese patients with diabetes

Hamasaki, H.

2026-06-10 endocrinology 10.64898/2026.06.09.26355239 medRxiv
Top 2%
1.3%
Show abstract

Aims: Sarcopenia and sarcopenic obesity are associated with increased risks of cardiovascular (CV) disease and mortality. This study examined the associations of body composition and daily physical activity with mortality, CV events and cancer in patients with diabetes. Methods: This prospective cohort study included patients with diabetes treated at a specialised clinic in Japan between January 2018 and March 2023. Body composition, including visceral adipose tissue (VAT), was assessed by bioelectrical impedance analysis. Daily physical activity was evaluated using the non-exercise activity thermogenesis (NEAT) questionnaire, and handgrip strength (HGS) was measured by dynamometry. Cox proportional hazards models were used to assess associations with mortality, CV events, and cancer. Results: Among 2,024 patients (mean age 63.0 years, BMI 24.6 kg/m^2, HbA1c 7.8%), NEAT, HGS, and VAT were not independently associated with all-cause mortality. Higher VAT was associated with increased cancer risk (HR 1.485; 95% CI 1.101-2.003; p = 0.009). Higher HGS was inversely associated with CV event risk (HR 0.951; 95% CI 0.919-0.984; p = 0.004). NEAT was not associated with any outcome. Conclusions: Higher VAT was associated with increased cancer risk, whereas higher HGS was protective against CV events. Incorporating body composition and HGS assessments into clinical practice may improve risk stratification and management in patients with diabetes.

12
Physical activity, fatty acids, and MASLD risk: Behavioural and metabolic factors jointly shaping liver health in populations

Chen, F.; You, R.; Liu, Y.; Yin, Y.; Liu, A.; Deng, L.; Xie, B.; Fan, J.; Wang, W.

2026-06-08 epidemiology 10.64898/2026.06.05.26354982 medRxiv
Top 2%
1.2%
Show abstract

Background and Aims: MASLD has become the most prevalent chronic liver disease globally. Although MVPA and plasma fatty acids have been individually studied in relation to metabolic health, their independent and combined associations with MASLD incidence remain unclear. We aimed to investigate these associations. Methods: This study included 51,717 UK Biobank participants free of liver disease at baseline, with MVPA measured using wrist-worn accelerometers and plasma fatty acids quantified via NMR. Multivariable-adjusted Cox models and restricted cubic splines were used. Results: Over a median follow-up of 7.8 years, 472 incident cases were identified. In fully adjusted models, meeting recommended MVPA levels together with higher n-6 PUFA concentrations was associated with a 71% lower risk (HR 0.29, 95% CI 0.18-0.45). The MVPA-MASLD association was nonlinear, with risk reduction plateauing at approximately 189 minutes per week. Higher n-6 PUFA was associated with reduced risk, whereas n-3 PUFA showed no significant association. Conclusions: These findings suggest that behavioral and metabolic factors may jointly influence MASLD risk. Further studies in diverse populations are needed to confirm these associations.

13
Low-Dose Aspirin Adherence Following Objective cell-free RNA-Based Preeclampsia Risk Testing: A Real-World Survey Study

Moe, A. B.; Haverty, C.; Lee, M.; Hahn, S. E.; McElrath, T. F.; Jain, M.; Rasmussen, M.; Corso, A.; Larson, M. L.; Morrison, H.; Melroy, L. M.; Roofeh, J.; Phelps-Sandall, B.; Kiefer, D.; Biggio, J. R.

2026-06-10 obstetrics and gynecology 10.64898/2026.06.08.26355195 medRxiv
Top 2%
1.2%
Show abstract

Introduction: Preeclampsia (PE) is a leading cause of maternal and neonatal morbidity and mortality, and low-dose aspirin (LDA) prophylaxis is the cornerstone of evidence-based prevention. Despite guideline recommendations, LDA adherence remains poor, with 10-25% of moderate-risk patients taking aspirin. Objective personalized risk stratification using biomarkers has been shown to motivate behavior change in other disease contexts. Survey data suggest that patients are more motivated to take aspirin if informed by an objective predictive test. Here, we report real-world LDA adherence among patients who received a high-risk result from a cell-free RNA (cfRNA) PE risk prediction test. Methods: This retrospective, observational survey study included asymptomatic patients of advanced maternal age (AMA; [&ge;] 35 years at delivery) with singleton pregnancies without USPSTF-defined preexisting high-risk conditions for PE who received the cfRNA PE risk prediction test. Patients who opted in to receive text message surveys were asked about LDA use following receipt of test results. High adherence was defined as reporting LDA use on at least 6 of 7 days per week at least 85% of the time surveyed. The primary analysis included patients with a high-risk test result and at least one LDA frequency survey response following receipt of test result. The observed proportion of adherent patients was compared to a baseline estimate of 25% using an exact binomial test. Results: Of 166 patients who received a cfRNA PE risk prediction test result, 48 (28.9%) received a high-risk result. Of these, 29 (60%) opted in and responded to at least one survey, constituting the primary analysis population. Twenty-seven of the 29 (93.1%; 95% CI: 78.0-98.1%) were classified as highly adherent, significantly higher than the 25% baseline adherence estimate for moderate-risk patients (p < 0.0001). Conclusion: Among surveyed patients who received a high-risk cfRNA PE test result, the proportion classified as highly adherent to LDA (93%) substantially exceeded published estimates of adherence in a similar patient population and met the clinically meaningful threshold of [&ge;] 80% associated with reduced risk of preterm preeclampsia. These findings indicate that objective and personalized biomarker risk testing may be a powerful driver of behavior change that current guidelines have failed to produce.

14
An integrated proteogenomic investigation of the human liver uncovers molecular drivers of steatotic liver disease

Gobeil, E.; Bourgault, J.; Enault, M.; Cote, V.; Mitchell, P. L.; Ruel, L.-J.; Girard, A. S.; Vohl, M.-C.; Arsenault, B. J.

2026-06-06 endocrinology 10.64898/2026.06.04.26354903 medRxiv
Top 2%
1.2%
Show abstract

Metabolic dysfunction-associated steatotic liver disease (MASLD) is rapidly increasing worldwide, yet effective targeted therapies remain limited. To better understand the molecular mechanisms underlying MASLD, we performed an integrated proteogenomic analysis of human liver tissue. Using mass spectrometry, we quantified 2,744 proteins in 504 liver biopsies from the Quebec Obesity Biobank and examined changes across disease stages. To investigate causality, we integrated liver proteomics with RNA sequencing and genome-wide genotyping to map thousands of protein quantitative trait loci (pQTLs) and expression quantitative trait loci (eQTLs). These molecular data were combined with summary statistics from a meta-analysis of genome-wide association studies including 16,532 MASLD cases and 1,240,188 controls. Mendelian randomization and genetic colocalization analyses revealed that most proteins differentially expressed across MASLD stages were not causally implicated in disease risk, whereas several genetically predicted liver proteins showed evidence of causal effects. Among these, higher hepatic levels of the MTARC1 protein were causally associated with MASLD and hepatic fat accumulation. Phenome-wide analyses suggested that MTARC1 inhibition may reduce the risk of cirrhosis, hepatocellular carcinoma, and cholelithiasis while improving lipid profiles. Notably, the causal MTARC1 variant influenced liver protein levels but not gene expression. Genetic analyses also identified ERLIN1 and HSD17B13 as potential therapeutic targets. In contrast, eQTLs and pQTLs at other loci such as GCKR showed opposite effects on MASLD risk. These findings highlight the importance of integrating tissue proteomics with human genetics to distinguish biomarkers from causal drivers and to identify promising therapeutic targets for MASLD.

15
Clonal Hematopoiesis of Indeterminate Potential Refines Cardiovascular Risk Stratification in Cardiovascular-Kidney-Metabolic Syndrome Stages 0-3

Lu, J.; Sun, S.; Deng, Z.; Wang, S.; Wei, C.; Jiang, S.; Li, W.

2026-06-08 epidemiology 10.64898/2026.06.04.26354963 medRxiv
Top 2%
1.1%
Show abstract

Background: Chronic low-grade inflammation drives cardiovascular-kidney-metabolic (CKM) syndrome. Clonal hematopoiesis of indeterminate potential (CHIP), an age-related driver of systemic inflammation, is linked to several cardiometabolic disorders. However, whether CHIP modifies CKM progression and contributes to heterogeneity in cardiovascular disease (CVD) risk within the CKM framework remains uninvestigated. Methods: This cohort study included 307,025 UK Biobank participants at CKM stages 0-3 free of baseline CVD. CHIP status was identified via whole-exome sequencing (WES). The association between CHIP and baseline CKM severity was examined, along with the independent and joint effects of CHIP and CKM stages on incident CVD risk. The joint effects of CHIP and polygenic risk scores (PRS) were further assessed, and the incremental predictive value of incorporating CHIP into the AHA PREVENT equations was evaluated. Results: CHIP carriers were more likely to present with advanced CKM stages [OR 1.14 (1.09-1.20), P < 0.001] and exhibited higher incident CVD risk during follow-up [HR 1.13 (1.08-1.18), P < 0.001]. Significant joint effects between CHIP and CKM stages were observed, with the highest risk among CHIP carriers at CKM stage 3 [HR 1.63 (1.50-1.78), P < 0.001]. Large or multiple CHIP mutations conferred greater hazards, with distinct gene-specific effects observed. Moreover, CHIP and high genetic risk also jointly amplified CVD susceptibility. Most importantly, incorporating CHIP into AHA PREVENT significantly improved risk discrimination. Conclusions: CHIP is a significant risk factor associated with more advanced CKM stages and amplifies incident CVD risk. Integrating CHIP into existing prevention strategies may refine CVD risk stratification.

16
Reprogramming of Iron and Oxygen Metabolism Across the Spectrum of Primary Aldosteronism

Parisien-La Salle, S.; Tsai, C. H.; Newman, A. J.; Heydarpour, M.; Mahrokhian, S.; Hanna, I.; Brown, J. M.; Waikar, S.; Moussa, M.; Vaidya, A.

2026-06-10 endocrinology 10.64898/2026.06.09.26355256 medRxiv
Top 3%
1.0%
Show abstract

Background: Pathologic aldosteronism induces oxidative stress, tissue injury, and increases in hemoglobin. Conversely, aldosterone antagonist therapy decreases hemoglobin. Whether these effects are attributable to aldosterone-mediated changes in iron and oxygen metabolism is unknown. Methods: The plasma proteome of participants with overt primary aldosteronism (PA) (n=50) was compared with participants without overt PA (n=61). To isolate aldosterone-dependent effects, participants without overt PA underwent oral sodium suppression testing to quantify the magnitude of renin-independent aldosterone production, enabling monotonic dose-response analyses across the continuum of renin-independent aldosteronism (subclinical to overt PA). Differential abundance testing was performed using empirical Bayes linear modeling, followed by Reactome pathway enrichment analysis and covariate-adjusted sensitivity analyses. To validate clinical relevance, aldosterone dose-response trends with blood count parameters were examined in this cohort, and an independent population-based cohort of 5,713 people with hypertension. Results: 903 proteins in the peripheral circulation were differentially abundant in overt PA versus participants without PA. The most significantly increased protein in overt PA was CYBRD1, involved in iron reduction and absorption. Pathway enrichment identified 16 iron- and heme-related pathways, including erythropoietin signaling, heme biosynthesis and mitochondrial iron-sulfur cluster biogenesis, with increases in heme and erythroid proteins and decreases in mitochondrial iron-sulfur proteins. Linear aldosterone dose-dependent trend analyses across the PA continuum further supported this signature, identifying progressive increases in hemoglobin subunits (HBA1/HBB), heme-related proteins (HMBS, UROS, AMBP, HPX, GLO1) and erythrocyte oxygen handling enzymes (CA1/CA3), alongside progressive reductions in mitochondrial electron transport chain subunits (CYCS, ETFA). These proteomic changes corresponded with aldosterone dose-dependent increases in red blood cell count, hemoglobin, and hematocrit, in this cohort and another population-based cohort. Conclusion: The continuum of PA is characterized by a progressive shift away from mitochondrial oxidative phosphorylation and toward increased intestinal iron absorption, preferential iron transport over storage, and enhanced heme synthesis and recycling, possibly reflecting cellular pseudohypoxia and systemic adaptations to increase oxygen delivery. These findings provide a novel mechanistic basis for aldosterone-mediated tissue injury and the benefits of aldosterone-directed therapy.

17
Liver biopsy confirms precise and efficient correction of SERPINA1 after in vivo Base Editing in a Patient with Alpha-1 Antitrypsin Deficiency

Krooss, S. A.; Yang, T.; Yuan, Q.; Drick, N.; Sgodda, M.; Held, J.; Behrendt, P.; Hartleben, B.; Koczulla, R.; Ma, X.; Liu, Y.; Wedemeyer, H.; Janciauskiene, S.; Di Donato, N.; Cantz, T.; Wang, E.; Wu, Y.; Hoeper, M.; Xia, Q.; Ott, M.

2026-06-09 genetic and genomic medicine 10.64898/2026.06.01.26354551 medRxiv
Top 3%
1.0%
Show abstract

Background: Alpha-1 antitrypsin deficiency (AATD) caused by the PI*ZZ mutation (Glu342Lys) results in hepatic accumulation of misfolded AAT-Z protein and reduced circulating AAT levels, leading to progressive liver disease and emphysema. Gene correction therapy represents a potentially curative approach by directly correcting the underlying genetic defect. We report the first case of successful hepatic gene correction with early histological and functional assessment. Methods/Case presentation: We report the case of a 66-year-old male patient with PI*ZZ AATD who underwent gene correction therapy within the YOLT-202 phase I/Ia clinical trial (clinical trial.gov ID NCT07193615). Ten weeks post treatment a liver biopsy was performed to re-evaluate pre-existing F2 liver fibrosis as measured by elastography before entering the study. Serum samples allowed functional assessment of the AAT-mediated elastase inhibition. Results: Liver biopsy did not show signs of hepatic inflammation and demonstrated 54% (Sanger) and 57% (Illumina) gene correction rate of the PI*ZZ variant on the DNA level with no bystander edits or off-target effects. Following a transient elevation of transaminases during the early post-treatment period, liver enzymes normalized. Monthly serum AAT measurements demonstrated biologically active and stable therapeutic levels throughout follow-up. Conclusions: This case demonstrates efficient and precise hepatic gene correction without concerning histological alterations and with substantial improvement of functional parameters, supporting the feasibility and safety of gene editing approaches for AATD.

18
HbF/F-cell and the Phenotype of Sickle Cell Disease

Wilks, A.; Lofters, J.; Lee, J.; Milton-Hicks, J.; Klings, E.; Steinberg, M.

2026-06-04 hematology 10.64898/2026.06.02.26354737 medRxiv
Top 3%
0.9%
Show abstract

Fetal hemoglobin (HbF) prevents the polymerization of sickle hemoglobin (HbS). HbF, measured usually as a percent of total hemoglobin (%HbF), is inversely associated with the severity of sickle cell disease (SCD) but fails to capture the distribution of HbF concentrations within red blood cells (RBCs). The relative proportion of HbF and HbS within a RBC is reflected by the HbF:HbS ratio whereas HbF/F-cell quantifies the absolute amount of HbF/RBC. While correlated, HbF:HbS ratio and HbF/F-cell are not interchangeable. In the context of mean corpuscular hemoglobin (MCH), HbF/F-cell approximates whether sufficient HbF is present to inhibit HbS polymerization. We examined the association of mean HbF/F-cell with sub-phenotypes of sickle cell disease in three independent cohorts. Both %HbF and HbF/F-cell were significantly associated with multiple clinical and laboratory features of SCD; however, HbF/F-cell demonstrated stronger associations with clinical severity measures across cohorts. Higher HbF/F-cell was associated with fewer clinical events, reduced hemolysis, and mortality. Changes in HbF/F-cell after hydroxyurea treatment were associated with ~11-13% reduction in acute events in patients with <1 pg increase and >60% reduction with a >5 pg increase in HbF/F-cell. For each pg increase in HbF/F-cell there was ~6% reduction in the rate of acute events. As a surrogate for the distribution of HbF concentrations among F-cells, HbF/F-cell adds physiologically relevant insights that could guide prognosis and treatment

19
Alcohol Consumption Patterns and Sociodemographic Correlates Among US Adults with Cardiovascular Disease: A Cross-Sectional Analysis of All of Us and NHANES

yang, q.; yu, j.; zhao, h.; zou, m.; sun, y.

2026-06-09 public and global health 10.64898/2026.06.06.26355052 medRxiv
Top 3%
0.9%
Show abstract

This cross-sectional study aimed to examine the prevalence of alcohol use and its sociodemographic correlates among adults with cardiovascular disease (CVD). We analyzed data from two large US cohorts: the All of Us Research Program (2017-2023) and the National Health and Nutrition Examination Survey (NHANES, 1999-2016). Both CVD diagnosis and past-year alcohol consumption were self-reported. Risky drinking was defined as exceeding moderate drinking or binge drinking (All of Us), or moderate/heavy drinking (NHANES). Multivariable logistic regression was used to exam associations with sociodemographic and lifestyle factors. Among 32,788 current drinkers with CVD in the All of Us cohort, 15% exceeded moderate drinking thresholds and 26% reported binge drinking. Older age, female sex, and higher socioeconomic status were inversely associated with risky drinking, while smoking was positively associated. In NHANES, moderate drinking rose from 47.3% to 57.2% and heavy drinking from 6.7% to 7.2%. Moderate/heavy drinking was positively associated with age <65 but inversely with age [&ge;]65. Higher education and income were linked to moderate drinking, while current smoking was strongly associated with heavy drinking. These results highlight the need to integrate holistic screening for alcohol use, tobacco use, and social context into routine cardiovascular care.

20
Exploring the role of binge eating in the association between ADHD and BMI: A twin study

YOU, Y.; McAdams, T.; Oginni, O.; Liu, C.; Herle, M.; Zavos, H.

2026-06-05 psychiatry and clinical psychology 10.64898/2026.05.28.26354354 medRxiv
Top 3%
0.8%
Show abstract

Objective: ADHD has been associated with obesity indicators, including BMI, across the lifespan. A possible mechanism linking ADHD and BMI is binge eating. Previous research has found associations between ADHD, binge eating and BMI. However, the role of genetic and environmental influences on these associations remains unclear. Method: We utilized data from the Twins Early Development Study (TEDS), comprising 3,675 monozygotic and 7,063 dizygotic twin pairs. ADHD symptoms in childhood and adolescence were assessed using parent-reported questionnaires. Adult ADHD symptoms were measured using both self-report and parent-report questionnaires. Phenotypic mediation models examined whether binge eating mediated the association between ADHD and BMI, without controlling for genetic confounding. Subsequently, the etiological architecture underlying the associations among the three traits across childhood, adolescence, and adulthood were investigated by incorporating genetic and environmental influences into the models. Results: Binge eating significantly mediated the association between ADHD symptoms and BMI in both adolescence and adulthood. However, these mediation effects were no longer present once genetic and environmental influences were incorporated into the models. The best-fitting model in childhood, adolescence and adulthood was Cholesky decomposition models, where covariance between traits was explained by shared aetiology. Conclusions: This twin study reveals shared liability across ADHD, binge eating, and BMI. The mediating role of binge eating in the relationship between ADHD symptoms and BMI was largely confounded by shared genetic influences. Intervention strategies could focus more on common underlying behavioural and self-regulatory mechanisms across these traits, as well as placing more emphasis on symptom patterns within families.